Literature Survey: Study of Reordering in Pivot Based SMT
نویسندگان
چکیده
Pivot Based SMT solves the problem of scarcity of source-target parallel corpus by introducing a third resource rich ‘pivot’ language. Triangulation method in Pivot Based SMT is a method that uses the pivot language to induce new phrase pairs into the phrase table, this process is known as ‘Phrase Table Triangulation’. Phrase Table Triangulation has been extensively studied by many researchers. This paper surveys the past work in Pivot Based SMT, specifically in Triangulation Method. It discusses in detail, the work done in Pivot Based SMT at IIT Bombay. It also surveys the work in some of the other areas which are important for Pivot Based SMT such as, System Combination Techniques, Domain Adaptation and various Reordering Models.
منابع مشابه
Triangulation of Reordering Tables: An Advancement Over Phrase Table Triangulation in Pivot-Based SMT
Triangulation in Pivot-Based Statistical Machine Translation(SMT) is a very effective method for building Machine Translation(MT) systems in case of scarcity of the parallel corpus. Phrase Table Triangulation helps in such a resource constrained setting by inducing new phrase pairs with the help of a pivot. However, it does not explore the possibility of extracting reordering information throug...
متن کاملThe TALP&I2r SMT systems for IWSLT 2008
This paper gives a description of the statistical machine translation (SMT) systems developed at the TALP Research Center of the UPC (Universitat Politècnica de Catalunya) for our participation in the IWSLT’08 evaluation campaign. We present Ngram-based (TALPtuples) and phrase-based (TALPphrases) SMT systems. The paper explains the 2008 systems’ architecture and outlines translation schemes we ...
متن کاملTranslation Quality Indicators for Pivot-based Statistical MT
Recent research on multilingual statistical machine translation focuses on the usage of pivot languages in order to overcome resource limitations for certain language pairs. This paper provides new insights into what factors make a good pivot language and investigates the impact of these factors on the overall pivot translation performance. Pivot-based SMT experiments translating between 22 Ind...
متن کاملA Hybrid Machine Translation System Based on a Monotone Decoder
In this paper, a hybrid Machine Translation (MT) system is proposed by combining the result of a rule-based machine translation (RBMT) system with a statistical approach. The RBMT uses a set of linguistic rules for translation, which leads to better translation results in terms of word ordering and syntactic structure. On the other hand, SMT works better in lexical choice. Therefore, in our sys...
متن کاملPhrase Reordering Model Integrating Syntactic Knowledge for SMT
Reordering model is important for the statistical machine translation (SMT). Current phrase-based SMT technologies are good at capturing local reordering but not global reordering. This paper introduces syntactic knowledge to improve global reordering capability of SMT system. Syntactic knowledge such as boundary words, POS information and dependencies is used to guide phrase reordering. Not on...
متن کامل